a 
u 

W 



SEQUENCE LISTING 

<110> Hoechsi\ Marion Roussel 

<120> HUMAN ht\FIH GENE AND CODED htfUIA PROTEIN 

<130> 9823seq 

<140> 
<141> 

<160> 10 

<170> Patentln Vers .\ 2.0 

<210> 1 
<211> 1273 
<212> ADN 
<213> Human 

<220> 
<221> CDS 

<222> (176) . . (1270) 

<400> SEQ ID No: 1 
atgcgcagca gcggcgccga cgcggggdag tgcctggtga ccgcgcgcgc tcccggaagt 60 

gtgccggcgt cgcgcgaagg ttcagcagg\ agccgtgggc cgggcgcgcc ggttcccggc 120 

acgtgtctcg gcacgtggca gcgcgcctgg Vcctgggctt ggaggcgccg gcgcc ctg 178 

Leu 
1 



09/83U26 



JC08 Rec'd PCT/PT0 0 8MAY200f 



p 
hi 



gat ccg ccg gcc gtg gtc gcc gag teg &tg teg tec ttg acc ate gec 
Asp Pro Pro Ala Val Val Ala Glu Ser vVl Ser Ser Leu Thr He Ala 
5 10 \ 15 

gac gcg ttc att gca gcc ggc gag age tea \ct ccg acc ccg ccg cgc 
Asp Ala Phe He Ala Ala Gly Glu Ser Ser Ala Pro Thr Pro Pro Arg 
20 25 \ 30 

ccc gcg ctt ccc agg agg ttc ate tgc tec ttc Vet gac tgc age gcc 
Pro Ala Leu Pro Arg Arg Phe He Cys Ser Phe pVo Asp Cys Ser Ala 
35 4 0 4T 

aat tac age aaa gcc tgg aag ctt gac gcg cac ctg Vgc aag cac acg 
Asn Tyr Ser Lys Ala Trp Lys Leu Asp Ala His Leu Cys Lys His Thr 
50 55 60 A 65 

ggg gag aga cca ttt gtt tgt gac tat gaa ggg tgt ggc Vag gcc ttc 
Gly Glu Arg Pro Phe Val Cys Asp Tyr Glu Gly Cys Gly lVs Ala Phe 
70 75 \ 80 

ate agg gac tac cat ctg age cgc cac att ctg act cac aca >aga gaa 
He Arg Asp Tyr His Leu Ser Arg His He Leu Thr His Thr Oly Glu 
85 90 95 



226 



274 



322 



370 



418 



466 




aag ccq ttt gtt tgt gca gcc act ggc tgt gat caa aaa ttc aac aca 

Lys PrcAPhe Val Cys Ala Ala Thr Gly Cys Asp Gin Lys Phe Asn Thr 

LOO 105 110 

aaa tea afec ttg aag aaa cat ttt gaa cgc aaa cat gaa aat caa caa 

Lys Ser AsYi Leu Lys Lys His Phe Glu Arg Lys His Glu Asn Gin Gin 

115 \ 120 125 

aaa caa tat ata tgc agt ttt gaa gac tgt aag aag acc ttt aag aaa 

Lys Gin Tyr lie Cys Ser Phe Glu Asp Cys Lys Lys Thr Phe Lys Lys 

130 \ 135 140 145 

cat cag cag ctg \aa ate cat cag tgc cag cat acc aat gaa cct eta 

His Gin Gin Leu l\s lie His Gin Cys Gin His Thr Asn Glu Pro Leu 

150L 155 160 

ttc aag tgt acc cag §aa gga tgt ggg aaa cac ttt gca tea ccc age 

Phe Lys Cys Thr Gin GJSu Gly Cys Gly Lys His Phe Ala Ser Pro Ser 

165 \ 170 175 



aag ctg aaa cga cat gcc 
Lys Leu Lys Arg His Ala 
180 



ag gcc cac gag ggc tat gta tgt caa aaa 
Ala His Glu Gly Tyr Val Cys Gin Lys 
185 190 



gga tgt tec ttt gtg gca aaa \ca tgg acg gaa ctt ctg aaa cat gtg 
Gly Cys Ser Phe Val Ala Lys Thr Trp Thr Glu Leu Leu Lys His Val 
195 200 \ 205 



aga gaa acc cat aaa gag gaa ata 
Arg Glu Thr His Lys Glu Glu He 
210 215 



:a tgt gaa gta tgc egg aaa aca 
*u Cys Glu Val Cys Arg Lys Thr 
220 225 



514 



562 



610 



658 



706 



754 



802 



850 



□ 

yi 
o 
m 



ttt aaa cgc aaa gat tac ctt aag caa &ac atg aaa act cat gcc cca 
Phe Lys Arg Lys Asp Tyr Leu Lys Gin hYs Met Lys Thr His Ala Pro 
230 235 240 



gaa agg gat gta tgt cgc tgt cca aga gaa 
Glu Arg Asp Val Cys Arg Cys Pro Arg Glu 
245 250 



^gc tgt gga aga acc tat 
y Cys Gly Arg Thr Tyr 
255 



898 



946 



act act gtg ttt aat etc caa age cat ate etc t^cc ttc cat gag gaa 

Thr Thr Val Phe Asn Leu Gin Ser His He Leu S&r Phe His Glu Glu 
260 265 \ 270 

age cgc cct ttt gtg tgt gaa cat get ggc tgt ggc &aa aca ttt gca 

Ser Arg Pro Phe Val Cys Glu His Ala Gly Cys Gly L^s Thr Phe Ala 

275 280 285 



994 



1042 



atg aaa caa agt etc act agg cat get gtt gta cat gat 
Met Lys Gin Ser Leu Thr Arg His Ala Val Val His Asp 
290 295 300 



rt gac aag 
Asp Lys 
305 



1090 



aag aaa atg aag etc aaa gtc aaa aaa tct cgt gaa aaa egg kgt ttg 
Lys Lys Met Lys Leu Lys Val Lys Lys Ser Arg Glu Lys Arg S^r Leu 
310 315 32t 



1138 



gcc tct cat etc agt gga tat ate cct ccc aaa agg aaa caa ggg 



1186 



Ala Ser His Leu Ser Gly Tyr He Pro Pro Lys Arg Lys Gin Gly Gin 
325 330 335 

ggc tta tc\ ttg tgt caa aac gga gag tea ccc aac tgt gtg gaa gac 
Gly Leu SerNLeu Cys Gin Asn Gly Glu Ser Pro Asn Cys Val Glu Asp 
340 \ 345 350 

aag atg etc tcAaca gtt gca gta ctt acc ctt ggc taa 
Lys Met Leu Ser\hr Val Ala Val Leu Thr Leu Gly 
355 \ 360 ■ 365 



1234 



1273 



<210> 2 
<211> 365 
<212> PRT 
<213> Human 




<400> SEQ ID No: 2 
Leu Asp Pro Pro Ala Val VaV Ala Glu Ser Val Ser Ser Leu Thr He 
1 5 \ 10 15 

Ala Asp Ala Phe He Ala Ala <ky Glu Ser Ser Ala Pro Thr Pro Pro 
20 \ 25 30 

Arg Pro Ala Leu Pro Arg Arg Phe \le Cys Ser Phe Pro Asp Cys Ser 
35 40 \ 45 

Ala Asn Tyr Ser Lys Ala Trp Lys LeuNAsp Ala His Leu Cys Lys His 
50 55 \ 60 

Thr Gly Glu Arg Pro Phe Val Cys Asp TyrVGlu Gly Cys Gly Lys Ala 
65 70 V5 80 

Phe He Arg Asp Tyr His Leu Ser Arg His Ile\ Leu Thr His Thr Gly 
85 90 \ 95 

Glu Lys Pro Phe Val Cys Ala Ala Thr Gly Cys Asfa Gin Lys Phe Asn 
100 105 \ HO 

Thr Lys Ser Asn Leu Lys Lys His Phe Glu Arg Lys Hiss Glu Asn Gin 
115 120 12T 

Gin Lys Gin Tyr He Cys Ser Phe Glu Asp Cys Lys Lys T^r Phe Lys 
130 135 140 

Lys His Gin Gin Leu Lys He His Gin Cys Gin His Thr Asn^lu Pro 
145 150 155 \ 160 

Leu Phe Lys Cys Thr Gin Glu Gly Cys Gly Lys His Phe Ala SeX Pro 
165 170 175 

Ser Lys Leu Lys Arg His Ala Lys Ala His Glu Gly Tyr Val Cys 
180 185 190 



Lys Gly Cys Ser Phe Val Ala Lys Thr Trp Thr Glu Leu Leu Lys His 
195 200 205 



Val Arg Glu Thr His Lys 
210 



C3 

m 
u 



\ 



C3 



Thr Phe\Lys Arg 
225 

Pro Glu Ar\ Asp 



Lys Asp 
230 

Val Cys 
245 



Glu Glu 
215 

Tyr Leu 
Arg Cys 



lie Leu Cys Glu Val Cys Arg Lys 
220 

Lys Gin His Met Lys Thr His Ala 

235 240 

Pro Arg Glu Gly Cys Gly Arg Thr 

250 255 



Tyr Thr Thr v\l 
26 

Glu Ser Arg Pro 
275 

Ala Met Lys Gin 
290 

Lys Lys Lys Met 
305 

Leu Ala Ser His 



Gin Gly Leu Ser 
340 



Phe Asn Leu Gin Ser His He Leu Ser Phe His Glu 
265 270 



;ie Val 



Cys Glu 
280 

Thr Arg 
295 



His Ala Gly Cys Gly Lys Thr Phe 
285 

His Ala Val Val His Asp Pro Asp 
300 



Asp Lys Met Leu Ser Thr 
355 



/s Val Lys Lys Ser Arg Glu Lys Arg Ser 
315 320 

He Pro Pro Lys Arg Lys Gin Gly 
330 335 

v Gly Glu Ser Pro Asn Cys Val Glu 
15 350 

Va\ Leu Thr Leu Gly 
365 




<210> 3 
<211> 1273 
<212> ADN 
<213> Human 

<400> SEQ ID No: 3 
atgcgcagca gcggcgccga 

gtgccggcgt cgcgcgaagg 

acgtgtctcg gcacgtggca 

tccgccggcc gtggtcgccg 

agccggcgag agctcagctc 

ctccttccct gactgcagcg 

caagcacacg ggggagagac 

cagggactac catctgagcc 

tgcagccact ggctgtgatc 

acgcaaacat gaaaatcaac 



cgcggggcgg 
ttcagcaggg 
gcgcgcctgg 
agtcggtgtc 
cgaccccgcc 
ccaattacag 
catttgtttg 
gccacattct 
aaaaattcaa 
aaaaacaata 



ccgcgcgcgc 
cgggcgtegcc 
ggaggcgo 
atcgccgacg 
cttcccagga 
aagcttgacg 
gggtgtggca 
ggagaaaagc 
aacttgaaga 
tttgaagact 



tcccggaagt 60 
ggttcccggc 120 
gcgccctgga 180 
-gttcattgc 240 
ggVtcatctg 300 
cgcacctgtg 360 
aggcctticat 4 20 
cgtttgttVg 480 
aacattttgA 540 
gtaagaagac 1500 




catcagcagc 
caggaaggat 
cacgagggct 
gaaacatg 
tt\aaacgca 
tgtcV:tgtc 
catatc\tct 
aaaacat 
aagaaaatga 
agtggatata 
gagtcaccca 
taa 



tgaaaatcca 
gtgggaaaca 
atgtatgtca 
tgagagaaac 
aagattacct 
caagagaagg 
ccttccatga 
caatgaaaca 
igctcaaagt 
tc\ctcccaa 
actg\gtgga 



tcagtgccag 
ctttgcatca 
aaaaggatgt 
ccataaagag 
taagcaacac 
ctgtggaaga 
ggaaagccgc 
aagtctcact 
caaaaaatct 
aaggaaacaa 
agacaagatg 



cataccaatg 
cccagcaagc 
tcctttgtgg 
gaaatactat 
atgaaaactc 
acctatacta 
ccttttgtgt 
aggcatgctg 
cgtgaaaaac 
gggcaaggct 
ctctcgacag 



aacctctatt 
tgaaacgaca 
caaaaacatg 
gtgaagtatg 
atgccccaga 
ctgtgtttaa 
gtgaacatgc 
ttgtacatga 
ggagtttggc 
tatctttgtg 
ttgcagtact 



660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1273 



<210> 4 
<211> 1213 
<212> ADN 
<213> Human 

<400> SEQ ID No: 4 

gtgccggcgc cgcgcgaagg ttcagcaggg agccg\gggc 
acgtgtctcg gcacgtggca gcgcgcctgg ccctggg\tt 
tccgccggcc gtggtcgccg agtcggtgtc gtccttgac' 
agccggcgag agctcagctc cgaccccgcc gcgccccgcg 
ctccttccct gactgcagcg ccaattacag caaagcctgg 
caagcacacg ggggagagac catttgtttg tgactatgaa 
cagggactac catctgagcc gccacattct gactcacaca 
tgcagccact ggctgtgatc aaaaattcaa cacaaaatca 
acgcaaacat gaaaatcaac aaaaacaata tatatgcagt 
ctttaagaaa catcagcagc tgaaaatcca tcagtgccag 
caagtgtacc caggaaggat gtgggaaaca ctttgcatca 
tgccaaggcc cacgagggct atgtatgtca aaaaggatgt 
gacggaactt ctgaaacatg tgagagaaac ccataaagag 



!vttc 



cgggcgcgcc 
ggaggcgccg 
atcgccgacg 
xccagga 
aa^cttgacg 
gggtg\ggca 
ggagaaaaqc 
aacttgaag 
tttgaagact 
cataccaatg 
cccagcaagc 
tcctttgtgg 
gaaatactat 



ggttcccggc 60 
gcgccctgga 120 
cgttcattgc 180 
ggttcatctg 240 
cgcacctgtg 300 
aggccttcat 360 
cgtttgtttg 420 
aacattttga 480 
gVaagaagac 54 0 
aacVctatt 600 
tgaaaoaaca 660 
caaaaacaStg 720 
gtgaagtatcX 780 



ccggaaaacJ^ tttaaacgca 
aagggatgta\tgtcgctgtc 
tctccaaagc oatatcctct 
tggctgtggc aaaacatttg 
tcctgacaag aaga\aaatga 
ctctcatctc agtgga\ata 
tcaaaacgga gagtcacc\^a 
tacccttggc taa 



aagattacct 
caagagaagg 
ccttccatga 
caatgaaaca 
agctcaaagt 
tccctcccaa 
actgtgtgga 



taagcaacac 
ctgtggaaga 
ggaaagccgc 
aagtctcact 
caaaaaatct 
aaggaaacaa 
agacaagatg 



atgaaaactc 
acctatacta 
ccttttgtgt 
aggcatgctg 
cgtgaaaaac 
gggcaaggct 
ctctcgacag 



atgccccaga 840 
ctgtgtttaa 900 
gtgaacatgc 960 
ttgtacatga 1020 
ggagtttggc 1080 
tatctttgtg 1140 
ttgcagtact 1200 
1213 




<210> 5 
<211> 34 
<212> ADN 
<213> Human 

<400> SEQ ID No: 5 
cggggtacca aaaatgcgca gcagcggcc 



<210> 6 
<211> 21 
<212> ADN 
<213> Human 

/<400> SEQ ID No: 6 
tccttccctg actgcagcgc c 



<210> 7 
<211> 20 
<212> ADN 
<213> Human 

<400> SEQ ID No: 7 
tgcacaggtg cgcgtcaagc 



cgac 



34 



21 



20 



<210> 8 
<211> 20 
<212> ADN 
<213> Human 

<400> SEQ ID No: 8 
cacaaacaaa tggtctctcc 



20 



<210> 9 
<211> 30 
<212> ADN 
<213> Human 





<400> SEQ ID N> 
cggtctagat tagci! 



<210> 10 
<211> 30 
<212> ADN 
<213> Human 



saggg taagtactgc 




<40O^ SEQ ID No: 10 
ccifcccgggg ccaagggtaa gtact<! 



<BZ* 

u 



1 



• 09/831426 

JC08 Rec'd PCT/PTO 0 8 MAY 200T 



SEQUENCE LISTING 

<110> Hoechst Marion Roussel 

<120> Human htFIIIA gene and coded htfUIA protein 

<130> 9823seq 

<140> 
<141> 

<160> 10 

<170> Patentln Vers. 2.0 

<210> 1 
<211> 1273 
<212> DNA 
<213> Human 

bar 

<220> 
m' <221> CDS 

yj <222> (176) . . (1270) 

^ <4 00> 1 

4« atgcgcagca gcggcgccga cgcggggcgg tgcctggtga ccgcgcgcgc tcccggaagt 60 

fU 

fn gtgccggcgt cgcgcgaagg ttcagcaggg agccgtgggc cgggcgcgcc ggttcccggc 120 

* acgtgtctcg gcacgtggca gcgcgcctgg ccctgggctt ggaggcgccg gcgcc ctg 178 

£J Met 

Ul 1 
r 



□ 

L4 



gat ccg ccg gcc gtg gtc gcc gag teg gtg teg tec ttg acc ate gec 226 
Asp Pro Pro Ala Val Val Ala Glu Ser Val Ser Ser Leu Thr lie Ala 
5 10 15 



gac gcg ttc att gca gcc ggc gag age tea get ccg acc ccg ccg cgc 274 
Asp Ala Phe He Ala Ala Gly Glu Ser Ser Ala Pro Thr Pro Pro Arg 
20 25 30 



ccc gcg ctt ccc agg agg ttc ate tgc tec ttc cct gac tgc age gcc 322 
Pro Ala Leu Pro Arg Arg Phe He Cys Ser Phe Pro Asp Cys Ser Ala 
35 40 45 

aat tac age aaa gcc tgg aag ctt gac gcg cac ctg tgc aag cac acg 370 
Asn Tyr Ser Lys Ala Trp Lys Leu Asp Ala His Leu Cys Lys His Thr 
50 55 60 65 

ggg gag aga cca ttt gtt tgt gac tat gaa ggg tgt ggc aag gcc ttc 418 
Gly Glu Arg Pro Phe Val Cys Asp Tyr Glu Gly Cys Gly Lys Ala Phe 
70 75 80 

ate agg gac tac cat ctg age cgc cac att ctg act cac aca gga gaa 466 
He Arg Asp Tyr His Leu Ser Arg His He Leu Thr His Thr Gly Glu 
85 90 95 

aag ccg ttt gtt tgt gca gcc act qgc tgt gat caa aaa ttc aac aca 514 
Lys Pro Phe Val Cys Ala Ala Thr Guy Cys Asp Gin Lys Phe Asn Thr 
100 105 | 110 




aaa tea aac ttg aag aaa cat ttt gaa cgc aaa cat gaa aat caa caa 
Lys Ser Asn Leu Lys Lys His Phe Glu Arg Lys His Glu Asn Gin Gin 
115 120 125 



562 



aaa caa tat ata tgc agt ttt gaa gac tgt aag aag acc ttt aag aaa 610 

Lys Gin Tyr He Cys Ser Phe Glu Asp Cys Lys Lys Thr Phe Lys Lys 

130 135 140 145 

cat cag cag ctg aaa ate. cat cag tgc cag cat acc aat gaa cct eta 658 

His Gin Gin Leu Lys He "His Gin Cys Gin His Thr Asn Glu Pro Leu 
150 155 160 

ttc aag tgt acc cag gaa gga tgt ggg aaa cac ttt gca tea ccc age 706 

Phe Lys Cys Thr Gin Glu Gly Cys Gly Lys His Phe Ala Ser Pro Ser 
165 170 175 

aag ctg aaa cga cat gec aag gec cac gag ggc tat gta tgt caa aaa 754 

Lys Leu Lys Arg His Ala Lys Ala His Glu Gly Tyr Val Cys Gin Lys 
180 185 190 

gga tgt tec ttt gtg gca aaa aca tgg acg gaa ctt ctg aaa cat gtg 802 

Gly Cys Ser Phe Val Ala Lys Thr Trp Thr Glu Leu Leu Lys His Val 
195 200 205 

Q 

MJ aga gaa acc cat aaa gag gaa ata eta tgt gaa gta tgc egg aaa aca 850 

P Arg Glu Thr His Lys Glu Glu He Leu Cys Glu Val Cys Arg Lys Thr 

yj 210 215 220 225 

^ ttt aaa cgc aaa gat tac ctt aag caa cac atg aaa act cat gec cca 898 

4~ Phe Lys Arg Lys Asp Tyr Leu Lys Gin His Met Lys Thr His Ala Pro 

HJ 230 235 24 0 

ni 

■Sf " 

s gaa agg gat gta tgt cgc tgt cca aga gaa ggc tgt gga aga acc tat 94 6 

JU Glu Arg Asp Val Cys Arg Cys Pro Arg Glu Gly Cys Gly Arg Thr Tyr 

Zt 245 250 255 

y i 

P act act gtg ttt aat etc caa age cat ate etc tec ttc cat gag gaa 994 

p Thr Thr Val Phe Asn Leu Gin Ser His He Leu Ser Phe His Glu Glu 
r\ 2 60 2 65 27 0 

r ~ age cgc cct ttt gtg tgt gaa cat get ggc tgt ggc aaa aca ttt gca 1042 

Ser Arg Pro Phe Val Cys Glu His Ala Gly Cys Gly Lys Thr Phe Ala 

275 280 285 , 

atg aaa caa agt etc act agg cat get gtt gta cat gat cct gac aag 1090 

Met Lys Gin Ser Leu Thr Arg His Ala Val Val His Asp Pro Asp Lys 

290 295 300 305 

aag aaa atg aag etc aaa gtc aaa aaa tct cgt gaa aaa egg agt ttg 1138 

Lys Lys Met Lys Leu Lys Val Lys Lys Ser Arg Glu Lys Arg Ser Leu 
310 315 320 

gee tct cat etc agt gga tat ate cct ccc aaa agg aaa caa ggg caa 1186 

Ala Ser His Leu Ser Gly Tyr He Pro Pro Lys Arg Lys Gin Gly Gin 
325 330 335 

ggc tta tct ttg tgt caa aac gga gag tea ccc aac tgt gtg gaa gac 1234 

Gly Leu Ser Leu Cys Gin Asn Gly Glu Ser Pro Asn Cys Val Glu Asp 
340 345 350 

aag atg etc teg aca gtt gca gta ctt acc ctt ggc taa 1273 

Lys Met Leu Ser Thr Val Ala Val Leu Thr Leu Gly 
355 360 365 



<210> 2 



<211> 365 
<212> PRT 
<213> Human 

<4Q0> 2 

Met Asp Pro Pro Ala Val Val Ala Glu Ser Val Ser Ser Leu Thr lie 
15 10 15 

Ala Asp Ala Phe lie Ala Ala Gly Glu Ser Ser Ala Pro Thr Pro Pro 
20 25 30 

Arg Pro Ala Leu Pro Arg Arg Phe lie Cys Ser Phe Pro Asp Cys Ser 
35 40 45 

Ala Asn Tyr Ser Lys Ala Trp Lys Leu Asp Ala His Leu Cys Lys His 
50 55 60 

Thr Gly Glu Arg Pro Phe Val Cys Asp Tyr Glu Gly Cys Gly Lys Ala 
65 70 75 80 

Phe He Arg Asp Tyr His Leu Ser Arg His He Leu Thr His Thr Gly 
85 90 95 

— Glu Lys Pro Phe Val Cys Ala Ala Thr Gly Cys Asp Gin Lys Phe Asn 

4?! 100 105 110 

m 

yj Thr Lys Ser Asn Leu Lys Lys His Phe Glu Arg Lys His Glu Asn Gin 

U 115 120 125 

r~ 

5™ Gin Lys Gin Tyr He Cys Ser Phe Glu Asp Cys Lys Lys Thr Phe Lys 

pj 130 135 14 0 

m 

* Lys His Gin Gin Leu Lys He His Gin Cys Gin His Thr Asn Glu Pro 

p 145 150 155 160 

* j Leu Phe Lys Cys Thr Gin Glu Gly Cys Gly Lys His Phe Ala Ser Pro 

D 165 170 175 

rl Ser Lys Leu Lys Arg His Ala Lys Ala His Glu Gly Tyr Val Cys Gin 

T, 180 185 190 

Lys Gly Cys Ser Phe Val Ala Lys Thr Trp Thr Glu Leu Leu Lys His 
195 200 205 

Val Arg Glu Thr His Lys Glu Glu He Leu Cys Glu Val Cys Arg Lys 
210 215 220 

Thr Phe Lys Arg Lys Asp Tyr Leu Lys Gin His Met Lys Thr His Ala 
225 230 235 240 

Pro Glu Arg Asp Val Cys Arg Cys Pro Arg Glu Gly Cys Gly Arg Thr 
245 250 255 

Tyr Thr Thr Val Phe Asn Leu Gin Ser His He Leu Ser Phe His Glu 
260 265 270 

Glu Ser Arg Pro Phe Val Cys Glu His Ala Gly Cys Gly Lys Thr Phe 
275 280 285 

Ala Met Lys Gin Ser Leu Thr Arg His Ala Val Val His Asp Pro Asp 
290 295 300 

Lys Lys Lys Met Lys Leu Lys Val Lys Lys Ser Arg Glu Lys Arg Ser 
305 310 315 320 

Leu Ala Ser His Leu Ser Gly Tyr He Pro Pro Lys Arg Lys Gin Gly 



4 



325 330 335 

Gin Gly Leu Ser Leu Cys Gin Asn Gly Glu Ser Pro Asn Cys Val Glu 
340 345 350 

Asp Lys Met Leu Ser Thr Val Ala Val Leu Thr Leu Gly 
355 360 365 



<210> 3 

<211> 1273 

<212> DNA 

<213> Human 

<400> 3 





atgcgcagca 


gcggcgccga 




t gcctggtga 


ccqcgcgcgc 

3 3 


teceggaagt 


60 




CftOCCCIQCQt 


cgcgcgaagg 


ttcaacaaaa 


aqccqtqqqc 


cgqqcqcqcc 

*-*333 3 3 


qqttcccqqc 


120 




acotatctco 


gcacgtggca 




ccct gggct t 


qqaqqcqccq 


qcqccct qqa 


180 




tccaccaocc 


gtggtcgccg 


aqtcqqtqtc 


gt ccttgacc 


at cgccgacg 


cgttcattgc 


240 


tssr 


aaccaacQaa 


agctcagctc 


cgaccccgcc 


gcqccccgcg 


cttcccagga 


ggttcat ctg 


300 




ct cct t ccct 


gactgcagcg 


ccaattacag 


ca'aagectgg 


aagct tgacg 


cgcacctgtg 


360 


u 


caagcacacg 


ggggagagac 


catttgt ttg 


t gactatgaa 


qqqtqtggca 

3 3 3 3 3 3 


aggcett cat 


420 


rij 


cagggactac 


catctgagcc 


gecacat t ct 


gactcacaca 


qqaqaaaaqc 


cgtttgtttg 


480 


U i 


tocaoccact 


ggctgt gate 


aaaaattcaa 


cacaaaatca 


aacttgaaga 


aacatt t t ga 


540 


p 


acgcaaacat 


gaaaat caac 


aaaaacaata 


t atatgeagt 


ttt gaagact 


gtaagaagac 


600 


£ 1-7 

~ \ 

0 

F-H 

□ 


ct ttaagaaa 


cat cagcagc 


tgaaaatcca 


t cagtgccag 


cataccaatg 


aacct ctatt 


660 


caagtgtacc 


caggaaggat 


gtgggaaaca 


ctt tgeatea 


cccagcaagc 


tgaaacgaca 


720 


tgccaaggcc 


cacgagggct 


atgtatgtca 


aaaaggatgt 


tcctttgtgg 


caaaaacatg 


780 




gacggaactt 


ctgaaacatg 


tgagagaaac 


ccataaagag 


gaaatactat 


gtgaagtatg 


840 




ccggaaaaca 


tttaaacgea 


aagattacct 


taagcaacac 


atgaaaactc 


atgccccaga 1 


900 




aagggatgta 


tgtcgctgtc 


caagagaagg 


ctgtggaaga 


acctatacta 


ctgtgtttaa 


960 




tctccaaagc 


catatcctct 


ccttccatga 


ggaaagcege 


ccttttgtgt 


gtgaacatgc 


1020 




tggctgtggc 


aaaacatttg 


caatgaaaca 


aagtct cact 


aggcatgetg 


ttgtacatga 


1080 




tcctgacaag 


aagaaaatga 


agctcaaagt 


caaaaaatct 


cgtgaaaaac 


ggagtttggc 


1140 




ctctcatctc 


agtggatat a 


t ccctcccaa 


aaggaaacaa 


gggcaaggct 


tatctttgtg 


1200 




tcaaaacgga 


gagtcaccca 


actgtgtgga 


agacaagatg 


ctctcgacag 


ttgeagtact 


1260 




tacccttggc 


taa 










1273 



<210> 4 
<211> 1213 
<212> DNA 
<213> Human 

<400> 4 



gtgccggcgc 


cgcgcgaagg 


tt cagcaggg 


acgtgtctcg 


gcacgtggca 


gcgcgcctgg 


tccgccggcc 


gtggtcgccg 


agtcggtgtc 


agccggcgag 


agct cagctc 


cgaccccgcc 


ctccttccct 


gactgcagcg 


ccaattacag 


caagcacacg 


ggggagagac 


catttgtttg 


cagggactac 


catctgagcc 


gccacat t ct 


tgcagccact 


ggctgtgatc 


aaaaattcaa 


acgcaaacat 


gaaaatcaac 


aaaaacaata 


ctt taagaaa 


catcagcagc 


tgaaaatcca 


caagtgtacc 


caggaaggat 


gtgggaaaca 


tgccaaggcc 


cacgagggct 


atgtatgtca 


gacggaactt 


ctgaaacatg 


tgagagaaac 


ccggaaaaca 


tttaaacgca 


aagattacct 


aagggatgta 


tgt cgctgtc 


caagagaagg 


tctccaaagc 


catatcctct 


ccttccatga 


tggctgtggc 


aaaacatttg 


caatgaaaca 


tcctgacaag 


aagaaaatga 


agctcaaagt 


ctctcatctc 


agtggatata 


tccctcccaa 


tcaaaacgga 


gagtcaccca 


actgtgtgga 


t acccttggc 


taa 





<210> 5 
<211> 34 
<212> DNA 
<213> Human 

<400> 5 

cggggtacca aaaatgcgca gcagcggcgc 

<210> 6 
<211> 21 
<212> DNA 
<213> Human 

<400> 6 

tccttccctg actgcagcgc c 

<210> 7 
<211> 20 
<212> DNA 
<213> Human 



agccgtgggc cgggcgcgcc ggttcccggc 60 
ccctgggctt ggaggcgccg gcgccctgga 120 
gtccttgacc atcgccgacg cgttcattgc 180 
gcgccccgcg cttcccagga ggttcatctg 240 
caaagcctgg aagcttgacg cgcacctgtg 300 
tgactatgaa gggtgtggca aggccttcat 360 
gactcacaca ggagaaaagc cgtttgtttg 420 
cacaaaatca aacttgaaga aacattttga 480 
tatatgcagt tttgaagact gtaagaagac 540 
tcagtgccag cataccaatg aacctctatt 600 
ctttgcatca cccagcaagc tgaaacgaca 660 
aaaaggatgt tcctttgtgg caaaaacatg 720 
ccataaagag gaaatactat gtgaagtatg 780 
taagcaacac atgaaaactc atgccccaga 840 
ctgtggaaga acctatacta ctgtgtttaa 900 
ggaaagccgc ccttttgtgt gtgaacatgc 960 
aagtctcact aggcatgctg ttgtacatga 1020 
caaaaaatct cgtgaaaaac ggagtttggc 1080 
aaggaaacaa gggcaaggct tatctttgtg 1140 
agacaagatg ctctcgacag ttgcagtact 1200 

1213 

1 

cgac 34 



21 





6 



<400> 7 

tgcacaggtg cgcgtcaagc 



20 



<210> 8 
<211> 20 
<212> DNA 
<213> Human 

<400> 8 

cacaaacaaa tggtctctcc 20 



<210> 9 
<211> 30 
<212> DNA 
<213> Human 



<400> 9 

cggtctagat tagccaaggg taagtactgc 



30 



<210> 10 
<211> 30 
<212> DNA 



<213> Human 



<400> 10 

cctcccgggg ccaagggtaa gtactgcaac 



30 
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